|
|
Accession Number |
TCMCG075C06008 |
gbkey |
CDS |
Protein Id |
XP_007042972.2 |
Location |
complement(join(7217819..7217913,7218574..7218655,7218744..7218941,7219393..7219846,7220111..7221000)) |
Gene |
LOC18608297 |
GeneID |
18608297 |
Organism |
Theobroma cacao |
|
|
Length |
572aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_007042910.2
|
Definition |
PREDICTED: U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Theobroma cacao] |
CDS: ATGGAAGTTGATGATGACAACAGTCCACCAACTTCATCTGCAGAACCTTCAGTTGCAGTTCCCGACAGCCAAACCACAGTGCCTACGGCAAATAACACTCTACTTCAGCCTGTTCAGCCTATTGTTCCAGCAGTGGTCCCGCCTGCAGTTGTACCACCTATTGCTCCGCTACCTGCCATTCCTCCAGTCCCTGTTGTGCATCCATTGGCACCTCTCCCGATTCGCGCTCCCATCCTTAAACCACTTGCACCTCAAAATGGTGAGGTGAAAACTAGTGATTCAGACTCTGACCATGAGGATGAAGGGCGAACTGCTGCTGTTGATTATGAGATCTCAGAAGAGAGTAGACTGGTTAGAGAGCGGCAAGAAAAGGCCATGCAAGAACTTCTGATGAAACGACGTGCTGCTGCACTAGCAGTGCCTACTAATGACATGGCTGTCCGGACTCGACTTCGCCGGCTTGGTGAACCCATAACTCTTTTTGGAGAAAGAGAGATGGAAAGGCGGGATAGGCTGCGAATGATTATGGCAAAGCTGGATTCTGAGGGGCAGTTGGAGAAGTTGATGAAGGCGCATGAGGAGGAAGAGGCTGCAGTTTCTGCTAAAATGGAGGACGTTGAGGAAGACATTCAATATCCCTTTTATACTGAGGGTCCAAACGAGCTCTTGGATGCTAGAATTGATATTGCAAAGTACTCTGTTGTAAAGGCAGCTATGCGTGTTCAACGTGCACAGAGAAAAAGGGATGATCCAGATGAAGATATGGATGCTGAAACTGATTGGGCTCTAAGGCAGGCAGGGAATTTGGTTCTTGATTGCAGTGAAATTGGGGATGATAGACCACTTTCCGGTTGTTCTTTCTCACGTGATGGACAACTTCTTGCCACCTGCTCATTGAGTGGAGTTGCTAAGTTGTGGTCAATGCCTAGGGTAAGTAAGGTTTCTGCCTTAAAGGGCCACACAGAACGTGCAACTGATGTTACATTTTCTCCTGTGCATGATCATTTAGCGACTGCTTCTGCTGACCGAACAGCAAAGTTGTGGAACACTGATGGATCACTCCTGACAACATTTGAGGGCCATTTGGATCGCCTTGCACGCATAGCCTTCCATCCTTCAGGGAAGTACCTTGGCACAACAAGCTTTGATAAAACATGGAGACTGTGGGACATAGACAGTGGTGTAGAGTTGCTTCTCCAAGAAGGTCATAGTAGGAGTGTCTATGGAATTGCGTTCCACCAAGATGGATCTTTAGCAGCATCCTGTGGACTTGATGCACTTGCTCGTGTTTGGGATCTTCGCACTGGTAGAAGTGTTCTTGCTTTGGAAGGCCATGTCAAGCCAGTTCTTGGTGTGAGTTTTTCACCCAATGGCTACCATTTAGCTACAGGAGGTGAAGATAATACCTGTCGAATATGGGATTTGAGGAAGAAAAAATCCCTCTACATCATACCAGCCCACTCAAATCTTATATCACAAGTGAAGTTTGAGCCTCAAGAGGGATATTACTTGGTCACTGCTTCTTATGACATGACTGCAAAGGTTTGGTCCGGCAGAGATTTTAAGCCTGTCAAAAGCTTACCAGGTCATGAAGCTAAAGTCACTGCTTCAGATATTAGTGAAGATAGCCGGTATATTGTGACTGTCTCTCATGATCGAACAATAAAGCTGTGGACTGCTGGTAACATAGGAAAGGAAAAAGATATGGATTTGGACTGA |
Protein: MEVDDDNSPPTSSAEPSVAVPDSQTTVPTANNTLLQPVQPIVPAVVPPAVVPPIAPLPAIPPVPVVHPLAPLPIRAPILKPLAPQNGEVKTSDSDSDHEDEGRTAAVDYEISEESRLVRERQEKAMQELLMKRRAAALAVPTNDMAVRTRLRRLGEPITLFGEREMERRDRLRMIMAKLDSEGQLEKLMKAHEEEEAAVSAKMEDVEEDIQYPFYTEGPNELLDARIDIAKYSVVKAAMRVQRAQRKRDDPDEDMDAETDWALRQAGNLVLDCSEIGDDRPLSGCSFSRDGQLLATCSLSGVAKLWSMPRVSKVSALKGHTERATDVTFSPVHDHLATASADRTAKLWNTDGSLLTTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWDIDSGVELLLQEGHSRSVYGIAFHQDGSLAASCGLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLYIIPAHSNLISQVKFEPQEGYYLVTASYDMTAKVWSGRDFKPVKSLPGHEAKVTASDISEDSRYIVTVSHDRTIKLWTAGNIGKEKDMDLD |